NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Retraining with Predicted Hard Labels Provably Increases Model Accuracy

Das, Rudrajit; Dhillon, Inderjit_S; Epasto, Alessandro; Javanmard, Adel; Mao, Jieming; Mirrokni, Vahab; Sanghavi, Sujay; Zhong, Peilin (May 2025, https://doi.org/10.48550/arXiv.2406.11206)

Training with noisy labels often yields suboptimal performance, but retraining a model with its own predicted hard labels (binary 1/0 outputs) has been empirically shown to improve accuracy. This paper provides the first theoretical characterization of this phenomenon. In the setting of linearly separable binary classification with randomly corrupted labels, the authors prove that retraining can indeed improve the population accuracy compared to initial training with noisy labels. Retraining also has practical implications for local label differential privacy (DP), where models are trained with noisy labels. The authors propose consensus-based retraining, where retraining is done selectively on samples for which the predicted label matches the given noisy label. This approach significantly improves DP training accuracy at no additional privacy cost. For example, training ResNet-18 on CIFAR-100 with ε = 3 label DP achieves over 6% accuracy improvement with consensus-based retraining.
more » « less
Free, publicly-accessible full text available May 7, 2026
Prediction sets for high-dimensional mixture of experts models

https://doi.org/10.1093/jrsssb/qkae117

Javanmard, Adel; Shao, Simeng; Bien, Jacob (January 2025, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Abstract Large datasets make it possible to build predictive models that can capture heterogenous relationships between the response variable and features. The mixture of high-dimensional linear experts model posits that observations come from a mixture of high-dimensional linear regression models, where the mixture weights are themselves feature-dependent. In this article, we show how to construct valid prediction sets for an ℓ1-penalized mixture of experts model in the high-dimensional setting. We make use of a debiasing procedure to account for the bias induced by the penalization and propose a novel strategy for combining intervals to form a prediction set with coverage guarantees in the mixture setting. Synthetic examples and an application to the prediction of critical temperatures of superconducting materials show our method to have reliable practical performance.
more » « less
Controlling the False Split Rate in Tree-Based Aggregation

https://doi.org/10.1080/01621459.2024.2376285

Shao, Simeng; Bien, Jacob; Javanmard, Adel (September 2024, Journal of the American Statistical Association)

Full Text Available
PriorBoost: An Adaptive Algorithm for Learning from Aggregate Responses

Javanmard, Adel; Fahrbach, Matthew; Mirrokni, Vahab (July 2024, Proceedings of the 41st International Conference on Machine Learning, PMLR, 2024.)

Full Text Available
PriorBoost: An Adaptive Algorithm for Learning from Aggregate Responses

Javanmard, Adel; Fahrbach, Matthew; Mirrokni, Vahab (July 2024, Proceedings of the 41st International Conference on Machine Learning, PMLR, 2024.)

Full Text Available
Optimistic Rates for Learning from Label Proportions

Li, Gene; Chen, Lin; Javanmard, Adel; Mirrokni, Vahab (July 2024, Proceedings of Thirty Seventh Conference On Learning Theory (COLT), PMLR, 2024.)

Full Text Available
Optimistic Rates for Learning from Label Proportions

Li, Gene; Chen, Lin; Javanmard, Adel; Mirrokni, Vahab (July 2024, Proceedings of Thirty Seventh Conference on Learning Theory, PMLR, 2024.)
Agrawal, Shipra; Roth, Aaron (Ed.)
Full Text Available
Adversarial Robustness for Latent Models: Revisiting the Robust-Standard Accuracies Tradeoff

https://doi.org/10.1287/opre.2022.0162

Javanmard, Adel; Mehrabi, Mohammad (May 2024, Operations Research)

Low-dimensional structure of data can solve the adversarial robustness-accuracy conflict for machine learning systems. Modern machine learning systems have demonstrated breakthrough performance in a multitude of applications. However, they are known to be highly vulnerable to small perturbations to the input data, known as adversarial attacks. There are many well-documented examples of such behavior, for example small perturbations of an image, which is imperceptible to a human, can significantly degrade performance of modern classifiers. Adversarial training has been put forward as a way to improve robustness of learning algorithms to adversarial attacks. However, this benefit often comes at the cost of decreasing accuracy on natural unperturbed inputs, pointing to a potential conflict between adversarial robustness and standard accuracy. In “Adversarial robustness for latent models: Revisiting the robust-standard accuracies tradeoff,” Adel Javanmard and Mohammad Mehrabi develop a theory to show that when the data enjoys low-dimensional structure, then it is possible to train models that are nearly optimal with respect to both, the standard and robust accuracies.
more » « less
Full Text Available
The curse of overparametrization in adversarial training: Precise analysis of robust generalization for random features regression

https://doi.org/10.1214/24-AOS2353

Hassani, Hamed; Javanmard, Adel (April 2024, The Annals of Statistics)

Full Text Available
Learning from Aggregate responses: Instance Level versus Bag Level Loss Functions

Javanmard, Adel; Chen, Lin; Mirrokni, Vahab; Badanidiyuru, Ashwinkumar; Fu, Gang (May 2024, The Twelfth International Conference on Learning Representations (ICLR), 2024)

Full Text Available

« Prev Next »

Search for: All records